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Abstract This paper deals with computing topological invariants such as connected com- 
ponents, boundary surface genus, and homology groups. For each input data set, we have 
designed or implemented algorithms to calculate connected components, boundary surfaces 
and their genus, and homology groups. Due to the fact that genus calculation dominates 
the entire task for 3D object in 3D space, in this paper, we mainly discuss the calculation of 
the genus. The new algorithms designed in this paper will perform: (1) pathological cases 
detection and deletion, (2) raster space to point space (dual space) transformation, (3) the 
linear time algorithm for boundary point classification, and (4) genus calculation. 

1 Introduction 

Computing topological properties for a 3D object in 3D space is an important task in image 
processing. The recent developments in medical imaging and 3D digital camera systems raise the 
problem of the direct treatment of digital 3D objects. In the past, 3D computer graphics and 
computational geometry have usually used triangulation to represent a 3D object. 

Basically, the topological properties of an object in 3D contains connected components, genus 
of its boundary surfaces, and other homologic and nomotopic properties [11] . In 3D, this problem 
of obtaining fundamental groups is decidable but no practical algorithm has yet been found. 
Therefore, homology groups have played the most significant role [9J [T2] . Research shows that 
a key factor of computing homology groups of 3D objects is the genus of the boundary surface of 
the 3D object [5]. 

Theoretical results show that there exist linear time algorithms for calculating genus and 
homology groups for 3D Objects in 3D space [SJ. However, the implementation of these algorithms 
is not simple due to the complexity of real data samplings. Most of the algorithms require the 
triangulation of the input data since it is collected discretely. However, for most medical images, 
the data was sampled consecutively, meaning that every voxel in 3D space will contain data. In 
such cases, researchers use the marching-cubes algorithm to obtain the triangulation since it is a 
linear time algorithm [18] . However the spatial requirements for such a treatment will be at least 
doubled by adding the surface-elements (sometimes called faces). 

The theoretical work of calculating genus based on simple decomposition will turn into two 
different procedures: (1) finding the boundary of a 3D object and then using polygon mapping, 
also called polygonal schema, (2) cell complex reductions where a special data structure will be 
needed. 

In this paper, we look at a set of points in 3D digital space, and our purpose is to find homology 
groups of the data set. The direct algorithm without utilizing triangulation was proposed by Chen 
and Rong in 2008 [5] However, this algorithm is based on the strict definition of digital surfaces. 
Many real 3D sets may not satisfy the definition. In other words, a set of connected points may 
not be able to be put into such a process without considerable associated theoretical and practical 
processes. 



In [B], we discuss the geometric and algebraic properties of manifolds in 3D digital spaces and 
the optimal algorithms for calculating these properties. We consider digital manifolds as defined 
in [5] . More information related to digital geometry and topology can be found in [14] and [15] . 
We presented a theoretical optimal algorithm with time complexity 0(n) to compute the genus 
and homology groups in 3D digital space, where n is the size of the input data [6]. 

The key in the algorithm in [6] is to find the genus of the closed digital surfaces that is 
the boundary of the 3D object. However, the new algorithm is based on the strict definition 
of closed digital surfaces in [5], which means that there are many cases of real sampling of 3D 
objects that do not satisfy the definition of digital surfaces. In this paper, we will also deal with 
extreme situations. We have designed an adding and deleting method to make the 3D object into 
manifolds. 

This paper provides a complete process that deals with simulated and real data in order to 
obtain the topological invariants such as connected components, boundary surface genus, and 
homology groups. 

For every input data set, we have designed or implemented algorithms to calculate connected 
components, boundary surfaces and their genus, and homology groups. This is due to the fact 
that genus calculation dominates the entire task for 3D objects in 3D space. In this paper we 
mainly discuss the calculation of the genus. The new algorithms designed in this paper will 
perform: (1) pathological cases detection and deletion, (2) raster space to point space (dual 
space) transformation, (3) the linear time algorithm for boundary point classification, and (4) 
genus calculation. 

This paper does not consider homology generators. For more details, please see [5] |20) . 

2 Concepts and Theoretical Results 

In this section, we review some existing work related to this paper including the genus of closed dig- 
ital surfaces, homology groups of manifolds in 3D digital space, and a theoretical linear algorithm 
of finding Homology Groups in 3D [6 . 

2.1 Genus of Closed Digital Surfaces 

Any continuous 3D object can be viewed as a collection of 3D voxels in digital or cubical space. 
Unless the sampling method is changed, any practical method of genus calculation must adapt to 
this fact. Medical imaging such as CT and MRI are such examples. 

Cubical space with direct adjacency, or (6,26)-connectivity in digital space[S], has the simplest 
topology in 3D digital spaces. It is also sufficient for the topological property extraction of digital 
objects in 3D. Two points are said to be adjacent in (6,26)-connectivity space if the Euclidean 
distance of these two points is 1, called direct adjacency. 

Let M be a closed (orientable) digital surface in the 3D grid space in direct adjacency. We 
know that there are exactly 6-types of digital surface points. This was first discovered by Chen 
and Zhang in [3]. Relation to different definitions of digital surfaces can be found in [4]. 

Assume that Mi (M3, M4, M5, Mg) is the set of digital points with i neighbors. We have the 
following result for a simply connected M [3] [5] : 



|M 3 | = 8+|M 5 |+2|M 6 |. 

M4 and Mq has two different types, respectively. 

The Gauss-Bonnet theorem states that if M is a closed manifold, then 



(1) 




(2) 



where dA is an element of area and Kq is the Gaussian curvature. 
Its discrete form is 
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Figure 1: Six types of digital surfaces points in 3D 



,K(p) = 2tt • (2 - 2g) 



{p is a point m m} j 

where g is the genus of M. 

Assume that Ki is the curvature of elements in Mi, i = 3,4,5,6. We have 



(3) 



Lemma 2.1 (a) K 3 = n/2, (b) K4 = 0, for both types of digital surface points, (c) K 5 = —n/2, 
and (d) K§ = —n, for both types of digital surface points. 

We obtained (see [6]), 

S = l + (|M B | + 2.|M 6 |-|M 3 |)/8. (4) 

The two examples show that the above formula is correct [6]. The first example shown in Fig. 
2, is the easiest case. 

The second example came from the Alexander horned sphere. See Fig. 3. 
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Figure 2: Simple examples of closed surfaces with g = 0, 1, 2 

2.2 Homology Groups of Manifolds in 3D Digital Space 

For a k- manifold, Homology group Hi, i = 0,...,k provides the information for the number of 
holes in each i-skeleton of the manifold. When the genus of a closed surface is obtained, we can 
then calculate the homology groups corresponding to its 3-dimensional manifold in 3D. 

The following result follows from standard results in algebraic topology [ID] , It also appears 
in [9]. Let bi = rankf/^ (M, Z) be the zth Betti number of M. The Euler characteristic of M is 
defined by 



3 



ft 



Cb) 



Figure 3: An example came from Alexander horned sphere in digital space 



X (M) = ^(-ir& 4 

i>0 

If M is a 3-dimensional manifold, Hi(M) = for all i > 3 essentially because there are no 
i-dimensional holes. Therefore, x(Af ) = b a — b\ + 62 — b%. Furthermore, if M is in R 3 , it must 
have nonempty boundary. This implies that 63 = 0. 

Theorem 2.2 Let M be a compact connected 3-manifold in S 3 . Then 

(a) H (M) = Z. 

(b) Hi(M) = zi bl ( dAI \ i.e. H\(M) is torsion-free with rank being half of rank H\{dM). 

(c) H2(M) = Z n ~ x where n is the number of components of dM . 

(d) H 3 (M) = unless M = S 3 . 

A proof of above theorem is shown in [6] . 

2.3 A Theoretical Linear Algorithm of Finding Homology Groups in 3D 

Based on the results we presented in the above subsections, we now describe a linear algorithm 
for computing the homology group of 3D objects in 3D digital space [6]. 

Assuming we only have a set of points in 3D. We can digitize this set into 3D digital spaces. 
There are two ways of doing so: (1) by treating each point as a cube-unit that is called the raster 
space, (2) by treating each point as a grid point, which is also called the point space. These two 
are dual spaces. Using the algorithm described in 0, we can determine whether the digitized set 
forms a 3D manifold in 3D space in direct adjacency for connectivity. The algorithm is in linear 
time. 

Algorithm 2.1 Let us assume that we have a connected M that is a 3D digital manifold in 3D. 

Step 1. Track the boundary of M, dM, which is a union of several closed surfaces. This algorithm 
only needs to scan though all the points in M to see if the point is linked to a point outside 
of M. That point will be on boundary. 
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Step 2. Calculate the genus of each closed surface in dM using the method described in Section 
2. We just need to count the number of neighbors on a surface, and put them in Mi, using 
the formula (5) to obtain g. 

Step 3. Using the Theorem 12. 21 we can get Ho, Hi, H^, and H3. Ho is Z. For H±, we need to 
get b\ (dM) that is just the summation of the genus in all connected components in dM. 
(See [10] and (jJJ.) Hi is the number of components in dM. H3 is trivial. 

Lemma 2.3 Algorithm 2.1 is a linear time algorithm. 

Therefore, we can use linear time algorithms to calculate g and all homology groups for digital 
manifolds in 3D based on Lemma 2.2 and Lemma 4.1. 

Theorem 2.4 There is a linear time algorithm to calculate all homology groups for each type of 
manifold in 3D. 

3 Algorithm Design and Implementation 

The algorithm described in Section 2 is a theoretical result. The implementation of the algorithm 
must consider all possible cases in practical data collection. We first need to find the boundary 
and then decide if the boundary is a 2D manifold. If the boundary data connecting voxel data 
sets are not purely defined digital surfaces, we will have three options: (1) we need to modify the 
data to meet the requirement before genus calculation, (2) if the change of the original data set is 
too great, we may need to stop the modification instead of outputting a result for reference, and 
(3) we make some limited changes, and then produce a result. 

The difference between the theoretical results and practical data processing is that we may 
not always get the input data we expected. In our case, the boundary of a solid object should be 
treated as a surface. However, practically, this might not always be the case. Some researchers also 
consider making real data sets "well" -organized. Siqueira et al considered making a 26-connected 
data set well-composed [21] [22] CQ- This means that two voxels will be connected by a sequence 
of voxels where each pair of two adjacent cubes share a 2D-cell (face-unit). The concept of well- 
composed is mathematically equivalent to 6-connected. An algorithm described in [21] [22] may 
generate new "none" well-composed cases, which are not good selections for genus calculation. 

Our new algorithm and implementation will perform: (1) pathological cases detection and 
deletion, (2) raster space to point space (dual space) transformation, (3) the linear time algorithm 
for boundary point classification, and (4) genus calculation. 

Some detailed considerations of recognition algorithms related to 3D manifolds can be found in 
[2] where Brimkov and Klette made extensive investigations in boundary tracking. The discussions 
of 3D objects in raster space can be found in [T7] , 

3.1 Input Data Sets 

This subsection will discuss the input data formats. We will focus on cubical data, for instance 
MM and CT data. In cubical data samples, we assume that the sampling is contiguous, where each 
sample point is normally followed by another sample point in its neighborhood. It is important 
to know this because a random sampling can cause the problem of uncertainty. In this case, we 
usually cannot calculate the genus without making an assumption. For instance, we will not be 
able to know where a hole is. In order to get simplicial decomposition (usually triangulation), we 
usually need to use Voronoi or Delaunay decomposition with boundary information. That means 
the boundary must be assumed. 

A new technology is called persistent homology analysis that tells us how to find the best esti- 
mation for the location of holes, usually by multiscaling (the upscaling and downscaling methods). 
However, this method is not a precise analysis [7] [2"5] . 

Even though, our method can be modified to be used in persistent analysis, this paper mainly 
deals with the method of precise genus and homology group calculation. 
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In summary, our assumption is that the digital object consists of cubical points (digital points, 
raster points). Each point is a cube, which is the smallest 3D object. The edge and point are 
defined with regards to the cube and an object may contain several connected components using 
a cube-linking path. Our purpose again is to calculate the topological properties of the object, or 
of each component, essentially. 

3.2 Searching connected components of a cubical data set 

Connected component search is an old task that can be done by using Tarjan's Breadth-first- 
search. Pavlidis was one of the first people to realize and use this algorithm in image processing. 
This problem is also known as the labeling problem. The complexity of the algorithm is 0(n) 

EM- 

The problem is what connectivity is based off of. In 3D, we usually have 6-, 18-, 26- connectivity. 
Since real data has noise, we have to consider all of those connectivities. So we must use 26- 
connectivety to get the connected components. 

Therefore, the connected component of the real processing is not a strictly 6-connected com- 
ponent. The topological theorem generated previously in [B] is no longer suitable. So we need 
to transform a 26-connected component into a 6-connected component. This should be done by 
a meaningful adding or deleting process since optimization on the minimum number of changes 
could be an NP-hard problem. 

Problem of minimum modifications: Given a set of points in 3D digital space, if this set 
is not a manifold, assume that the points are connected in a connectivity defined using adding or 
deleting processes to make the set a 3D manifold. The question becomes: is there a polynomial 
algorithm that makes the solution have minimum modifications where adding or deleting a data 
point will be counted as one modification? 

A similar problem was considered in [21] in which a decision problem of adding was proposed. 

This problem can be extended to a general fc-manifold in n-D space. Even though we have the 
6-connected component, there may still be cases that contain the pathological situation, which 
needs special treatment. We will discuss this issue in the next subsection. 

3.3 Pathological Cases Detection and Deletion 

In this paper, we only deal with the Jordan manifolds, meaning that a closed (n — l)-manifold 
will separate the n-manifold into two or more components. For such a case, only direct adjacency 
will be allowed since indirect adjacency will not generate Jordan cases. 

That is to say, if the set contains indirect adjacent voxels, we need to design an algorithm to 
detect the situation and delete some voxels in order to preserve the homology groups. 

It is known that there are only two such cases in cubical or digital space [5] : two voxels (3-cells) 
share a 0-cell or a 1-cell. Therefore, we want to modify the voxel set to only contain voxels where 
two of these cases do not appear. Two voxels share exactly a 2-cell, or there is a local path (in the 
neighborhood) of voxels where two adjacent voxels share a 2-cell [5]. A special case was found in 
22J that is the complement case of the case in which two voxels share a 0-cell (see Fig 4. (a)). 
The case may create a tunnel or could be filled. We will simplify it by adding a voxel in a 2 x 2 x 2 
cube. Such a case in point space is similar to the case (a) in Fig 4 since the boundaries of these 
cases are the same. 

The problem is that many real data sets do not satisfy the above restrictions (also called well 
composed image) . The detection is easy but deleting certain points (the minimum points deletion) 
to preserve the homology is a bigger issue. 

The following rules (observations) are reasonable: In a neighborhood N2j(p) that contains 8 
cubes and 27 grid points, 

a) if a voxel only shares a 0-cell with a voxel. This voxel can be deleted. 

b) if a voxel only shares a 1-cell with a voxel. This voxel can be deleted. 
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c) if a boundary voxel v shares a 0,1-cell with a voxel, assume v also shares a 2-cell with a 
voxel u, u must share a 0,1-cell with a voxel that is not in the object M. u is on the boundary. 
Deleting v will not change the topological properties. 

d) if in a 2 x 2 x 2 cube, there are 6 boundary voxels and its complement (two zero-valued 
voxels) is the case (a) in Fig (4) . Add a voxel to this 2x2x2 cube such that the new voxel shares 
as many 2-cells in the set as possible. This means that we want the adding voxel to be inside of 
the object as much as we can. 



Figure 4: Pathological Cases 

In this paper, we implement or modify the above rules to fit the theoretical definition of the 
digital surfaces. We also design an algorithm based on these rules to detect and delete some data 
points while preserving the topology. This is essential to calculating the genus correctly. However, 
when the object becomes more complex, pathological situations may still exist. 

The mathematical foundation of this above process that eliminates pathological cases is still 
under investigation. 

Mathematical foundation of Modifying a 3D object to be a 3D manifold: Given a 
set of points in 3D digital space, how would we modify the data set into a manifold without losing 
or changing the topology (in mathematics)? 

3.4 Boundary Search 

In general, a point is on the boundary if and only if it is adjacent to one point in the object and 
one point not in the object (in 26-connectivity). A simple algorithm that goes through each point 
and tests the neighborhood will determine whether a point is on the boundary or not. This is a 
linear time and 0(log{n)) space algorithm. 

The only thing special is that we use 26-connectivity to determine the boundary points. This 
is to take all possible boundary points into consideration in the next step. 

3.5 Determination of the Configuration of Boundary Points 

When all boundary points are found, we need to find their classifications. In other words, we need 
to determine whether a special point is in M3, M4, M5, or Mq. Here is the problem: if we only 
have one voxel, is it a point (0-cell) or a 3D object (3-cell)? In this paper, we treat it as a 3-cell. 
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M3= 1276 ; M4= 2974 ; M5= 
876; M6=204 

Boundary Qenus= 1 

totaINoneM5 0. totaENoneM6 



manifold Genus= 2 



M3= 2268 ; M4= 5456 ; M5= 1688 ; 
M6=301 

tota£NoneM5 2, totaiNoneM6 2 

No result since 4 pathological 
situations are included in the objects. 

Must delete those points in order to 
calculate the genus correctly. 



Figure 5: Without Pathological Case Process 

The input data is in raster space, but the boundary surface will be in point space. We must 
first make the translation. Then, for each point on the surface, we count how many neighbors 
exist in order to determine its configuration category. After that, we use formula (4) to get the 
genus. 

If we still need to find homology groups, we can just use the simple calculations based on 
Theorem 2.3 to obtain them. Using the program, we get the genus= 6 for a modified real image 
(Fig. 7). 




;=5; 

surface points=8950 



Figure 6: After Pathological Case Process 




Figure 7: A Modified Real Image 
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